CDS

Accession Number TCMCG024C29151
gbkey CDS
Protein Id XP_022005484.1
Location complement(join(134162863..134162988,134163062..134163169,134163247..134163490,134163571..134163806,134163927..134164117,134164202..134164436,134165185..134165403))
Gene LOC110903986
GeneID 110903986
Organism Helianthus annuus

Protein

Length 452aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA396063
db_source XM_022149792.2
Definition formamidase isoform X1 [Helianthus annuus]

EGGNOG-MAPPER Annotation

COG_category C
Description formamidase C869.04 isoform X1
KEGG_TC -
KEGG_Module -
KEGG_Reaction R00524        [VIEW IN KEGG]
KEGG_rclass RC02432        [VIEW IN KEGG]
RC02810        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K01455        [VIEW IN KEGG]
EC 3.5.1.49        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00460        [VIEW IN KEGG]
ko00630        [VIEW IN KEGG]
ko00910        [VIEW IN KEGG]
ko01200        [VIEW IN KEGG]
map00460        [VIEW IN KEGG]
map00630        [VIEW IN KEGG]
map00910        [VIEW IN KEGG]
map01200        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGCTCAACATGGTCCTAGACTGGTGGTGCCAATAGACGTAACCAAGAAACCAAGGGAACAGAAGCTTCCGCTTCATAACCGGTGGCACCCTGACATACCACCCGTTGCTGAGGTTCGTGTCGGGGAGGTGTTTCGGGTCGAGATGGTTGATTTTTCTGGTGGTGGTATCACTAAAGAATACACTGCTGAAGACATCAAATTCTCTGACCAATTTGTTGTGCATTATCTGAGTGGGCCAATTAGAGTTGTTGATGAGGATGGACCGGCTAAACCAGGCGATCTTCTTGCGGTTGAAATATGCAATTTGGGTGCTCTTCCTGGTGATGAATGGGGTTTTACTGCTATTTTTGATAGAGAAAATGGTGGTGGGTTCCTTACTGATCATTTCCCCTGTGCCACAAAAGCAATTTGGTATTTTGAAGGAATATATGCTTACTCTCCTCATATTCCAGGTGTACGGTTTCCGGGTTTAACACACCCGGGAATAATCGGAACAGCGCCTTCAATGGAGCTCCTTAATATATGGAATGAAAGGGAGAGAGAGCTTGAAGAAAATGGCTTAAAATCTTTAAAGTTATGTGAAGTTTTGCATTCAAGACCATTGGCAAACCTGCCTTCAACAAAAGGTTGTCTCCTCGGCAAGATTGAGGAAGGAAGTCGCGAATGGGAAAAGATGGCTAACGAGGCTGCAAGGACGATTCCGGGAAGGGAAAATGGCGGGAACTGTGACATCAAGAATCTAAGTAGAGGTTCAAAGATATACCTTCCAGTGTTTGTGGAAGGGGCTAACTTTAGTACTGGAGATATGCATTTTTCACAAGGAGATGGTGAAGTTTCCTTTTGTGGGGCCATTGAGATGAGTGGCTTCCTTGAGCTCAAGTGTGAGATAATAAGAGGAGGGATGAAAGAATATCTAACTCCAATGGGGCCTACTCCTCTTCATGTTAATCCTATATTCGAGATCGGGCCAGTAGAGCCCAGATTCTCAGAATGGTTAGTATTTGAGGGAATCAGTGTTGATGAGAGTGGAAGACAACATTACCTTGACGCCAGTGTTGCTTACAAGCGAGCCGTCCTAAATGCAATCGACTACCTGTCCAAATTTGGATATTCCAAGGAACAGGTGTATCTTTTATTGTCATGTTGTCCTTGCGAAGGAAGGATTTCAGGAATAGTTGATGCTCCAAATGCTGTCGCCACACTTGCAATTCCAACTGCTATATTTGATCAGGATATTCGCCCAAAGGCAAACAAGTTGCCAATAGGACCACGCGTTGTCAGGAATCCAGATATCCCAAGATGCACTTATGATGGAAATTTACCGATCACAAAGAACCTGAGTGCAACAGGAAGTTAA
Protein:  
MAQHGPRLVVPIDVTKKPREQKLPLHNRWHPDIPPVAEVRVGEVFRVEMVDFSGGGITKEYTAEDIKFSDQFVVHYLSGPIRVVDEDGPAKPGDLLAVEICNLGALPGDEWGFTAIFDRENGGGFLTDHFPCATKAIWYFEGIYAYSPHIPGVRFPGLTHPGIIGTAPSMELLNIWNERERELEENGLKSLKLCEVLHSRPLANLPSTKGCLLGKIEEGSREWEKMANEAARTIPGRENGGNCDIKNLSRGSKIYLPVFVEGANFSTGDMHFSQGDGEVSFCGAIEMSGFLELKCEIIRGGMKEYLTPMGPTPLHVNPIFEIGPVEPRFSEWLVFEGISVDESGRQHYLDASVAYKRAVLNAIDYLSKFGYSKEQVYLLLSCCPCEGRISGIVDAPNAVATLAIPTAIFDQDIRPKANKLPIGPRVVRNPDIPRCTYDGNLPITKNLSATGS